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OPTIMAL  PRICING,  USE  AND  EXPLORATION  OF  UNCERTAIN  NATURAL  RESOURCE  STOCKS 

by 

Kenneth  J.  Arrow  and  Sheldon  Chang— ^ 


Units  of  natural  resources,  to  be  called  "mines", 
are  assumed  distributed  over  the  unexplored  territory 
according  to  a Poisson  process  in  space.  At  any 
moment,  total  reserves,  R,  and  unexplored  land,  X,  are 
given.  Society  can  determine  the  rate  of  consumption, 
c,  and  the  rate  of  exploration  (space  per  unit  time),  x. 
Reserves,  R,  are  drawn  down  by  consumption  and  increased 
by  discoveries  made  during  exploration;  X is  decreased 
by  exploration.  At  any  moment,  the  payoff  to  society  is 
a concave  function  of  c less  a linear  function  of  x. 
Payoffs  in  the  future  are  discounted  back  to  the  present 
at  a constant  rate. 

The  optimal  policy  is  characterized  by  a decreasing 
function,  Rg(X).  When  R > Rg(X),  there  is  no  exploration. 
The  shadow  prices  of  reserves  and  of  unexplored  land 
both  rise  at  the  rate  of  interest,  and  consumption  is  such 
that  its  marginal  utility  equals  the  shadow  price  of 
reserves.  When  R decreases  to  Rg(X),  exploration  occurs 
at  an  infinite  rate;  R is  increased  by  random  discoveries 
and  X is  decreased  by  the  area  explored,  so  that  R and  X 
are  transformed  into  R'  and  X' , respectively,  with 
R'  > Rg(X').  For  large  values  of  X,  Rg(X)  is  almost  zero, 
and  the  shadow  prices  move  in  random  cycles  but  show  only 
a slight  upward  trend,  thereby  casting  some  light  on  the 
failure  of  mineral  prices  to  rise  at  the  market  rate  of 
interest. 


The  classic  Hotelling  [1931 ] model  of  exploitation  of  exhaustible 
resources  assumes  in  its  simplest  form  that  the  stock  of  the  resource 
is  known  from  the  beginning.  If  there  are  no  extraction  costs,  then  the 
shadow  prices  associated  with  an  optimal  extraction  policy  rise  at  the 
rate  of  the  market  rate  of  interest.  The  only  variable  that  has  to  be 
determined  is  the  initial  price,  which  then  determines. all  future  prices; 
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this  depends  upon  the  interaction  of  demand  (or  utility)  considerations 
with  the  initial  stock.  Clearly  in  a competitive  world,  prices  would  have 
to  rise  at  the  rate  of  interest  to  keep  resource-holders  indifferent 
between  extracting  the  resource  now  and  later. 

If  there  are  extraction  costs,  then  the  market  price  is  the  sum 
of  the  marginal  extraction  cost  and  the  rent  on  the  scarce  stock.  The 
latter  still  obeys  the  rule  of  the  preceding  paragraph,  so  that  the  rent 
will  rise  at  the  rate  of  interest. 

The  experience  of  many  minerals , most  strikingly  oil  from  1950  to 
1970,  is  that  the  theoretically-derived  increase  in  prices  is  not  observed. 
One  might  take  a very  long-run  view  of  the  world  and  say  that  the  post-1970 
increases  are  a sort  of  making  up  for  lost  time.  Perhaps  the  Hotelling 
theory  looks  better  when  1977  is  compared  with  1950.  Still  such  an  evasion 
gives  the  theory  little  value. 

No  doubt  many  explanations  for  the  discrepancy  are  possible . But 
certainly  one  rather  obvious  one  is  that  the  stock  of  the  resource  is 
very  far  from  known.  Hence,  new  discoveries  yielding  upward  revision 
of  estimated  reserves  change  the  basis  of  calculation  for  the  Hotelling 
rents.  The  predicted  rise  at  the  rate  of  interest  is  offset  by  repeated 
downward  revision  of  the  initial  price  in  response  to  changing  supply 
estimates. 

But  if  the  stocks  are  in  fact  uncertain,  as  evidence  by  repeated 


changes  in  estimates,  then  that  uncertainty  should  be  reflected  in  the 
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initial  planning.  It  is  the  aim  of  this  paper  to  begin  such  an  analysis.—' 


The  analysis  of  extraction  and  consumption  policy  under  conditions  of 
uncertainty  about  the  amount  of  reserves  has  been  studied  a number  of 


times  in  the  literature.  For  a survey,  see  Crabbe  [1977], 
recent  example,  see  Loury  [1976]. 


I 


It  is  important,  in  ny  Judgment,  to  distinguish  between  the  process 
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of  learning  about  reserves  and  the  consumption  of  them.—'  The  first 
constitutes  exploratory  activity.  It  is  clearly  not  the  case  that  the 
only  form  of  information  about  resource  reserves  comes  from  pumping  for 
consumption. 

The  analysis  in  this  paper  is  largely  heuristic,  and  some  points 
remain  to  be  clarified,  as  will  be  seen. 


1.  The  Model  of  Uncertainty  About  Resources 

Resources  are  assumed  to  be  distributed  randomly  throughout  the 
relevant  area;  they  can  discovered  by  costly  exploration. 

In  particular,  it  will  be  assumed  that  there  is  no  spatial  correla- 
tion of  resource  distribution.  In  probability  terms,  the  resource  quan- 
tities in  two  overlapping  areas  are  assumed  to  be  independent  random 
4/ 

variables .-y  It  turns  that  this  assumption , together  with  the  natural 
condition  that  resource  pools  are  non-negative,  severely  restricts  the 
range  of  possible  distribution  to  be  used.  In  particular,  the  resources 
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— Pindyck  [1978]  has  distinguished  between  exploration  and  extraction 
but  not  in  the  context  of  uncertainty.  Gilbert  [1978]  has  introduced 
exploration  as  a means  of  gaining  information,  as  in  the  present 
paper,  but  his  probability  assumptions  are  different.  Like  Loury 
[1976],  he  assumes  a given  probability  distribution  for  the  total 
stock;  extraction  and  exploration  change  the  distribution  only 
in  the  sense  that  the  posterior  distribution  is  conditioned  on  the 
statement  that  the  total  is  at  least  as  great  as  the  amount  extracted 
and  explored  for.  This  assumption  is  not  grounded  on  the  chance 
distribution  of  resources  over  the  surface  of  the  Earth. 

The  model  of  Deshmukh  and  Pliska  [1978],  developed  independently 
of  the  present  paper,  has  fundamentally  the  same  probability  assumptions 
(strictly  speaking,  theirs  are  more  general).  However,  they  assume 
that  there  is  no  limit  on  unexplored  land  and  hence  in  the  long  run 
the  resource  is  not  exhaustible. 

V Some  support  for  this  view  can  be  found  in  the  model  of  Menard 
and  Sherman  ([1975],  p.  337). 
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must  be  located  in  discrete  spots,  randomly  distributed  over  the  area. 

If  the  mines  are  equally  likely  to  be  found  in  all  parts  of  the  area,  then 
two  natural  assumptions  are  that  the  probability  of  finding  a mine  in  an 
area  h is,  for  small  h,  proportional  to  h,  while  the  probability  of 
finding  two  or  more  mines  in  a small  area  goes  to  zero  more  rapidly  theui 
h.  In  symbols. 


x(t)  = rate  of  exploratory  effort  at  time  t 

Then  in  the  interval,  (t,  t + dt),  for  small  dt,  the  area  explored 
is  x(t)dt.  If  we  let  h = x(t)dt,  we  are  assuming, 

(l)  Prob  (discovering  one  mine  between  t and  t + dt)  = Xx(t)dt  . 

The  probability  of  discovering  two  or  more  mines  in  this  small  interval 
is  infinitesimal  compared  with  (l)  and  can  be  disregarded. 
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The  learning  in  this  model  is  simple  but  not  non-existent.  It  is 
true  that  exploration  in  any  area  gives  no  information  about  resources 
outside  that  area.  However,  the  unexplored  area  is  diminishing;  the 
reserves  are  the  sum  of  those  known  to  be  in  the  explored  area  plus  a ran- 
dom variable  representing  reserves  in  the  unexplored  area,  but  the  variance 
of  the  latter  component  is  diminishing  as  the  unexplored  area  diminishes. 
Let, 

x(t)  = area  unexplored  at  time  t. 

By  definition, 

(2)  X(t)  = -x(t)  , 

where  the  dot  denotes  differentiation  with  respect  to  time,  X(o)  is 
given  and  ^(t)  is  restrained  to  be  non-negative;  presumably,  x (t) 
approaches  zero  as  t increases . 

We  make  now  one  central  simplifying  assumption : all  mines  have 
the  same  resource  quantities.  This  could  be  generalized  by  assuming 
that  there  is  a distribution  of  mine  sizes,  for  example,  logarithmic 
normal.  However,  I believe  that  this  generalization  can  be  accommodated 
with  little  change  in  the  following  analysis.  Without  loss  of  generality, 
the  amount  of  resource  in  each  mine  will  be  taken  as  1. 

Let, 

R(t)  * known  reserves  at  time  t , 

H(t)  * number  of  mines  discovered  between  0 and  t , 

c(t)  3 rate  of  consumption  of  the  resource  at  time  t 


t 

/ c(t)dt  . 
o 


The  known  reserves  at  time  t are  the  reserves  at  time  0 less  consump- 
tion up  to  time  t plus  the  number  of  mines  discovered  between  0 and  t. 


t 

(3)  R(t)  = R(0)  - / c(t)dt  + N(t) 

o 

The  variable  N(t)  is  a random  variable;  it  jumps  up  by  one  unit  in  any 
small  interval,  (t,  t + dt)  with  a probability  given  by  (l).  Therefore 
R(t)  is  also  a random  variable.  From  (3). 

(U)  R(t)  * -c(t)  if  t is  between  Junq?  points  of  N(t) 

R(t)  is  constrained  to  be  non-negative  everywhere. 


2.  The  Criterion  Function 

The  maximand  will  be  the  usual  integral  of  discounted  utilities. 

Here,  utility  at  any  point  of  time  depends  on  both  the  amount  of  the 
resource  being  discussed  and  on  "goods  in  general."  Since  even  the  most 
; important  mineral  requires  a relatively  small  fraction  of  national  income 

for  production,  it  is  reasonable  to  assume  the  absence  of  income  effects, 
so  that  utility  is  linear  in  goods-in-general . It  could,  therefore,  be 
ignored  in  analysis  except  that  we  will  regard  exploration  as  a costly 
activity.  The  inputs  to  exploration  will  be  assumed  to  be  goods  in  general. 
Let, 


P = price  of  exploration  in  terms  of  goods-in-general 


U(c)  - utility  of  consumption  of  resource  at  any  instant 


TOie  units  of  measurement  of  U can  be  taken  to  "be  goods- in- general;  hence, 
the  utility  generated  at  any  instant  is  U(c)  - Px,  vhen  the  consumption 
is  c and  the  rate  of  exploration  is  x. 

The  payoff  to  the  economy  is, 

« 

(5)  /e"pt[U(c(t))  - Px(t)]dt  , 
o 

vhere 

p = rate  of  time  discount  of  utilities 

However,  since  constraint  (3)  involves  a random  variable,  the 
payoff  (5)  will  be  a random  variable,  since  future  consumption  and 
exploration  will  depend  upon  the  random  discoveries  made  in  the  intervening 
period.  The  economy  will  therefore  seek  to  maximize, 

OD 

(6)  E{/e'pt[U(c(t))  - Px(t)]dt}  . 

o 

Criterion  (6)  is  to  be  maximized  with  respect  to  the  instruments, 
c(t),  x(t)  subject  to  the  constraints  that  X(t)  > 0,  R(t)  > 0 every- 
where, with  X(t)  and  R(t)  being  defined  by  (2)  and  (3)  respectively. 

✓ 

3.  Analysis  of  the  Optimal  Policy 

The  methods  are  those  of  dynamic  programming;  see  Bellman  and 
Dreyfus  [1962]  for  a heuristic  exposition. 

We  consider  the  maximum  value  of  (6)  (under  constraints)  as  a 
function  of  the  initial  values  of  the  two  state  variables,  R and  X. 


(7)  V(E,  X)  = max  E{  / e-pt[U( c(t) ) - Px(t)]dt}  when  R(0)  - R,  X(0)  = X. 

o 


Because  time  enters  explicitly  only  through  the  exponential  dis- 
count rate,  the  optimum  starting  from  t > 0 and  discounted  hack  to 

t (rather  than  zero)  would  he  the  same  function  of  R(t  )»  X(t  ) as 
o o o 

given  hy  (7).  Hence,  V(R,x)  is  the  sum  of  the  integral  over  the  interval 

r 

(0,t  ) and  the  expected  value  of  V at  time  t , discounted  hack  to 
o o 

time  0. 

*o  . -pt 

(8)  V(R,x)  = /e"pt[u(c(t))  - Px(t)]  + e 0E{V(R(tQ) , x(tQ) ) } . 

o 

Note  that  R(t  ) is  a random  variables,  because  its  value  depends  on  the 
o 

number  of  discoveries  in  the  interval  (0,tQ). 

Equation  (8)  is  valid  if  c(t)  and  x(t)  have  been  chosen  opti- 
mally. Alternatively,  we  may  consider  c(t)  and  x(t)  to  be  variables 

over  the  interval  (0,t  ) and  optimize  over  them,  on  the  assumption  that 

o 

V(R(t  ),  X(t  ))  is  itself  calculated  on  the  assumption  that  an  optimal 
o o 

policy  is  followed  thereafter.  Notice  that  R(tQ)  and  X(tQ)  both  depend 

on  the  choice  of  instruments  over  (0,t  ),  and  hence  the  optimization  must 

o 

take  account  of  the  effects  on  both  terms . 

Now  assume  that  t = dt,  a number  sufficiently  small  that  c and 


x may  be  regarded  as  constant  over  (0,dt).  We  can  also  disregard  the 
discounting  within  this  interval.  Hence,  the  first  term  in  (8)  can  he 
approximated  hy. 
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The  factor  e pdt  is  approximately  1 - pdt.  From  (3),  R(dt)  is  a random 
variable.  With  c treated  as  constant, 

R(dt)  = R(0)  - cdt  + N(dt)  , 

where  the  symbol,  means  "approximately  equal."  But  in  the  small 

interval  (0,  dt),  the  probability  of  more  than  one  mine  being  discovered 
is  negligible.  From  (l),  N(dt)  = 1 with  probability  approximately 
Axdt,  and  = 0 with  probability  1 - Xxdt.  Also,  X(dt)  :X  (0)  - xdt. 
Therefore, 


(9)  E{V(R( it) , x(dt) ) } : (1  - Xxdt)V(R(o)  - cdt,  x(0)  - xdt) 

+ XxdtV(R(0)  + 1 - cdt,  X(0)  - xdt) 


Assume  in  addition  that  V is  a differentiable  function  of  R 

and  X.  Let  V°  = 3V/3R  evaluated  at  (R(0),  X(o)),  V°  = 3V/3  X evaluated 
K X 

at  the  same  point,  and  and  Vd  the  two  partial  derivatives  evaluated 

R X 

at  <R(0)  + 1,  X(0)).  Also,  note  that  R(o)  = R,  x(0)  = x. 


V(R(0)  - cdt,  X(0)  - xdt)  = V(R,X)  - V°cdt  - V°:xdt  , 

K X 


V(R(0)  + 1 - cdt,  X(o)  - xdt)  r V(R  + 1,  X)  - Vdcdt  - Vjjxdt 


Substitution  into  (9),  some  rearrangement, and  discarding  terms  in  (dt)  yield. 


E{V(R( dt) , X(dt))}  - V(R,X)  - V°cdt  - V^xdt  + AxdtAV(R,x) 


(The  operator,  means  the  difference  between  the  function  at  R + 1 and 
at  R,  X being  held  constant.) 


If  we  substitute  this  and  the  other  results  into  (8),  subtract 
(1  - pdt)  V(R,  X)  from  both  sides,  divide  through  by  dt,  and  let  dt  approach 


10  - 


zero,  we  find, 

(10)  PV(R,X)  = max  {U(e)  - Px  - V°c  - V°x  + XxAV}  . 

c,x  R X 

Equation  (10)  will  not  hold  throughout  the  (R,  X)-plane.  Note 
that  no  upper  bound  has  been  set  to  the  exploration  rate  and  that 
exploration  costs  are  linear.  Therefore,  it  is  possible  and,  as  will 
seen  shortly,  true  that  there  will  be  moments  of  infinitely  rapid 
exploration.  We  may  think  of  a whole  area  being  explored 
with  a corresponding  cost  which  is  finite  in  total  but  incurred  at  an 
infinite  rate.  In  such  a case,  there  is  an  instantaneous  downward  jump 
in  the  state  variable  X (unexplored  land).  Equation  (10)  will  not  hold 
in  the  region  of  infinitely  rapid  exploration. 

Now  maximize  (10)  with  respect  to  c and  x,  the  instrument  values 
at  time  0.  For  the  consumption  rate,  c,  ignore  the  corner  possibility  c=0;  this 
is  certainly  legitimate  if  U'(0)  = +00,  i.e.,  if  the  resource  is 
dispensable. 

(11)  U'(c)  = V°  . 

The  maximand  in  (10)  is  linear  in  x. — — 

One  cannot  exclude  the  possibility  of  either  a corner  solution,  x = 0,  or 

of  an  infinitely  rapid  exploration;  indeed,  one  of  these  two  cases  will 

hold  almost  everywhere.  Clearly,  from  (10),  the  rate  of  exploration  (x) 

would  be  infinite  if, 

P + V°  < X AV  , 

A 

and  would  be  zero  if , 


P + V°  > X A V • 
A 
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Clearly,  V°  and  vS  are  the  shadov  rents  of  the  stock  of  reserves 
and  the  stock  of  unexplored  land,  respectively.  We  can  drop  the  super- 
script 0,  since  the  analysis  holds  at  any  time  t.  Let, 

p_  = rent  of  reserves  = V - , 

R R 

p = rent  of  unexplc land  = V 
X X 

Then  (ll  ) can  be  rewritten, 

(11')  U'(c)-pR  , 

Let, 

(12)  Am  = { (R,  X)  | P + px  < UV  }, 

(13)  Aq  = { (R,  X)  | P + px  > X AV  }, 

(14)  B = { (R,  X)  | P + px  = A AV  }, 

Then,  from  the  earlier  remarks, 

(12')  exploration  is  infinitely  rapid  for  (R,  X)  e A , 

oo 

(13’)  x = 0 for  (R,  X)  e A . 

o 

When  (R.  X)  e B,  it  is  so  far  possible  that  there  will  be  exploration 
at  a positive  but  finite  rate;  in  any  case,  B is  the  boundary  between  the 

two  regions  A and  A . 

o «> 

Equation  (11')  is  familiar;  the  shadow  price  of  consumption  must 
equal  the  shadow  price  of  holding  reserves.  Relations  (12’ -13')  are  more 
novel.  The  left-hand  sides  in  the  definitions  (12-14)  represent  the  costs 
of  exploration,  direct  costs  and  the  use  of  unexplored  land,  which  has  a 
value  due  to  its  potential  for  resources.  The  right-hand  sides  represent 
the  benefits,  the  gain  in  total  utility  (a  finite  increment,  not  a rate 
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of  change)  due  to  finding  a mine  multiplied  by  the  probability  of  finding 
one  per  unit  effort. 

We  first  investigate  the  structure  of  the  optimal  policy  and  of 
the  return  function , V,  in  region  Aq.  Here,  x = 0,  c is  given  by  (11'), 
and, 

pV  = U (c)  - VR  c, 

from  (10).  Differentiate  with  respect  to  R and  with  respect  to  X;  since 
c maximizes  the  right-hand  side,  it  can  be  regarded  as  constant  in 
accordance  with  the  "envelope  theorem." 

<15)  PVR  " ‘VRRC’  pVX  " - VRX  C’ 


Divide  the  first  equation  by  the  second. 


VVX  ‘ VRR/V 


EX' 


This  partial  differential  equation  holds  throughout  the  domain  Aq.  As  is 
well-known,  it  is  equivalent  to  the  statement  that  there  exist  functions 
V and  Rg  such  that. 


(16)  V (R,  X)  = V[R  + RgW]  for  (R,  X)  e Aq. 


Without  loss  of  generality,  we  may  set 
(17)  Re(0)  = 0. 

Note  that  RgOO  may  be  regarded  as  the  resource-equivalent  value  of 
unexplored  lands. 

The  importance  of  this  result  is  that  the  function  V is  the  return 
function  for  the  optimal  solution  in  the  case  where  there  are  no  unexplored 
lands,  as  can  be  seen  by  setting  X = 0 in  (16).  This  is  a standard 
problem  which  is  easily  solved.  Thus,  in  the  case  where  U(c)  = In  c,  it 
is  easy  to  see  that, 

V(R)  = (In  p - 1 + In  R)/  p. 
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Then,  in  the  domain  Aq,  the  instrument  variables,  c and  x,  are  determined 


by  the  relations,  x = 0,  and,  from  (11'), 
O’  (c)  * VR  = V'  [R  + Rgtt)]. 


The  movements  of  the  shadow  prices  in  A can  also  be  established 

o 


easily.  Note  that. 


VRR  * V"[R  + VX)1‘ 


Then, 


1 dpR 


PR  dt 


1 dVR  1 

VR  dt  VR  VRR  K’ 


i 


since  there  is  no  exploration,  R * - c,  by  (4).  Then  by  the  first  equation 
in  (15), 

(18)  PR/pR  = p £or  (R»  e A0* 

as  in  the  usual  Hotelling  theory.  From  (16), 


(19)  px  = Vx  = V'  R^.(X); 

hence  the  ratio  p /p  depends  only  on  X and  is  therefore  constant  along  any 
given  path.  Hence,  we  must  also  have, 

(20)  Px^pX  = P for  ^R’  E Ao* 

This  regime  continues,  with  R decreasing,  until  the  trajectory  hits 
B.  Note  that  the  behavior  has  been  defined  up  to  an  as  yet  unknown 
function,  Rg. 

Now  let  us  consider  the  determination  of  the  return  function, 

V (R,  X),  in  A*,.  Suppose  the  initial  situation  is  (R,  X + dX).  Exploration 
occurs  at  an  infinite  rate,  so  consumption  can  be  disregarded.  When 
the  area  dX  is  explored,  the  probability  of  a discovery  is  AdX  to  a first 
approximation.  If  a discovery  is  made,  the  point,  (R,  X + dX)  is  trans- 
formed into  (R+l,  X);  if  not,  it  is  transformed  into  (R,  X).  The  cost 
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incurred  is  P dX.  Hence, 

V (R,  X + dX)  = (X  dX)  V (R+l , X)  + (1  - X dX)  V(R,  X)  - P dX. 
Dividing  through  by  dX  and  letting  dX  approach  0 yields, 

(21)  VX(R,  X)  = X AV  (R,  x)-P  for  (R,  X)  e A^. 

Let  the  boundary  B be  represented  by  the  curve,  R * R^X),  so  that, 

(22)  R >Rg(X)  for  (R,  X)  £ Aq,  R < Rfi(X)  for  (R,  X)  £ A^. 

We  wish  to  determine  Rfi(X).  Since  the  point  (R^(X),  X)  is  on  the 
boundary  of  both  Aq  and  A^,  it  follows  by  continuity  that  is  given  both 
by  (19)  and  (21). 

(23)  V’[Rb(X)  + RE(X)]  R^,(X)  = XAV  [Rg(X),  X]  - P. 

For  a path  starting  from  some  (R,  X),  R > Rg(X),  V(R,  X)  = V[R  + Rg(X)] 

is  maximized.  If  R^  is  given  up  to  and  including 
X,  it  follows  that  we  wish  to  maximize  R^,(X),  since  this  act  maximizes 
Rj,(X  + dX).  From  (23),  then, 

(24)  R^(X)  = max  [XAV  (r,  x)  - P]/V'  [r  + R£(X)], 

r 

(25)  Rg(X)  is  the  value  of  r which  achieves  the  maximum  in  (24). 

These  equations,  together  with  (21)  and  (16)  completely  describe  the 
solution.  Start  with  V(R,  0)  = V(R),  a known  function.  Having  defined 
the  functions,  V(R,  X),  Rg(X)  UP  to  some  P°*nt  X for  R»  we 

and  R^(X)  from  (24)  and  (25).  We  can  then 

define  V(R,  X + dX)  from  (21)  for  R < Rfi  (X),  R£(X  + dX)  ■=  R£(X)  + R^,(X)  dX, 
and  therefore  V(R,  X+dX)  = V[R  + RE(X+dX)]. 

One  last  Important  point.  It  can  be  shown  that  R^  is  a decreasing 
function.  Suppose,  to  the  contrary,  that  it  were  increasing  in  some 
interval.  Start  with  a point  (R,  X+dX)  e B,  so  that  R * R^X  + dX).  If 
exploration  over  dX  produces  no  discovery,  then  we  arrive  at  the  point 


can  calculate  Rg(X) 


is  maximized  when  R^ (X) 
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(R,  X);  but  R ■ Rg(X  + dX)  > Rg(X),  so  that  (R,  X)  e A^.  If  a discovery 
is  made,  then  we  arrive  at  (R+l,  X),  which  certainly  belongs  to  Aq.  In 
that  case, 

AV(R,  X)  - V[R  + 1 + RgCX)]  - V[R  + Rg(X)]. 

In  the  maximization  in  (24),  R£(X)  is  taken  as  given.  Hence, 
maximizing  over  r is  equivalent  to  maximizing  over  s = r + Rg(X) . There- 
fore, (24)  becomes. 


and. 


R^.(X)  = max  [X  AV(s)  - P]  /V’(s). 
s 


Rg(x)  = s*  - Rg(X),  where  s*  achieves  the  maximization  above. 

But  the  maximand  is  now  independent  of  X,  so  that  s*  is  independent  of  X. 
Since  Rg(X)  is  certainly  increasing  (an  increase  in  unexplored  lands  can 
certainly  never  decrease  total  return),  R„(X)  must  be  decreasing,  a contra- 

O 

diction  to  the  original  assumption  that  Rfi  is  Increasing. 

That  Rg  is  decreasing  is  a bit  surprising  at  first.  However,  suppose 
there  is  an  infinite  amount  of  unexplored  territory,  the  case  studied  by 
Deshmukh  and  Pliska  [1978].  Under  our  special  assumptions  (linear  costs  of 
exploration),  it  is  easy  to  see  that  the  optimal  path  starting  with  some 
reserves  R is  to  exhaust  them  along  an  optimal  path  (with  prices  rising 
at  the  rate  p).  When  R = 0,  then  exploration  takes  place  at  an  infinite 
rate  until  a discovery  is  made.  There  is  no  reason  to  explore  earlier; 
there  is  no  information  gained,  since  a discovery  will  always  be  made  with 
probability  1,  and,  in  view  of  discounting,  it  pays  to  postpone  exploration 
until  the  last  possible  moment.  When  the  discovery  is  made,  exploration 
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In  this  case,  the  price  rises  at  the  rate  p until  exhaustion  but 

drops  abruptly  upon  discovery.  Over  a long  period  of  time,  price  has 

no  trend,  though  it  varies  periodically. 

Ue  may  assume  that  in  our  model,  with  large  X,  the  effect  is 

approximately  the  same  as  infinite  X.  Clearly,  with  U' (0)  * +*,  one  would 

not  proceed  to  absolute  exhaustion!  with  a finite  X,  there  is  always  a 

finite  probability  that  there  are  no  discoveries  to  be  made.  But  with 

-XX 

large  X,  the  probability  of  that  event,  e , is  very  small,  so  that 
RgCX)  would  be  expected  to  be  very  small.  On  the  contrary,  with  small  X, 
information  about  the  remaining  reserves  becomes  more  important,  so  that 
exploration  takes  place  at  a higher  level  of  reserves. 

The  price  history  will  show  fluctuations  with  little  upward  trend 
when  X is  large;  presumably  the  upward  trend  is  stronger  as  X approaches 
zero,  but  this  requires  a probabilistic  analysis  not  yet  performed. 

4.  Commentary. 

The  alternation  of  zero  and  infinite  rates  of  exploration  is  an 
unfelicitlou8  feature  of  the  model.  It  is  due  to  the  assumption  that 
exploration  costs  are  flows  and  can  be  turned  on  and  off  without  cost. 

If  one  added  to  the  model  the  need  for  capital  invested  in  exploration 
and  production,  there  would  be  a tendency  to  smooth  out  exploration  activities 
in  order  to  make  better  use  of  the  capital.  But  this  would  convert  the 
problem  into  one  with  three  state  variables,  which  would  be  even  more 


difficult 
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